Audio-Visual Speech Cue Combination

نویسندگان
چکیده

منابع مشابه

Audio-Visual Speech Cue Combination

BACKGROUND Different sources of sensory information can interact, often shaping what we think we have seen or heard. This can enhance the precision of perceptual decisions relative to those made on the basis of a single source of information. From a computational perspective, there are multiple reasons why this might happen, and each predicts a different degree of enhanced precision. Relatively...

متن کامل

Audio-visual speech recognition is consistent with Bayesian optimal cue combination

In the AV* condition, we intended to decrease the amount of information provided by the visual stimulus while preserving as much as possible the appearance of the talking face (see Figure S1). A natural appearance of the visual stimulus has been found to be important for effective AV fusion of speech (Schwartz JL et al., 2004). To this end, we used an “Active Appearance Model” (AAM) – a compute...

متن کامل

Continuous Audio-visual Speech Recognition Continuous Audio-visual Speech Recognition

We address the problem of robust lip tracking, visual speech feature extraction, and sensor integration for audiovisual speech recognition applications. An appearance based model of the articulators, which represents linguistically important features, is learned from example images and is used to locate, track, and recover visual speech information. We tackle the problem of joint temporal model...

متن کامل

Expressive audio-visual speech

We aim at the realization of an Embodied Conversational Agent able to interact naturally and emotionally with user. In particular, the agent should behave expressively. Specifying for a given emotion, its corresponding facial expression will not produce the sensation of expressivity. To do so, one needs to specify parameters such as intensity, tension, movement property. Moreover, emotion affec...

متن کامل

Audio Visual Speech Enhancement

This thesis presents a novel approach to speech enhancement by exploiting the bimodality of speech production and the correlation that exists between audio and visual speech information. An analysis into the correlation of a range of audio and visual features reveals significant correlation to exist between visual speech features and audio filterbank features. The amount of correlation was also...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: PLoS ONE

سال: 2010

ISSN: 1932-6203

DOI: 10.1371/journal.pone.0010217